This is an updated version of deberta-v2-xxlarge that implements the AutoModelForCausalLM class, enabling text generation. It fixes three issues in the original implementation, including the output embedding weight names, enhanced mask decoder implementation, and position embedding truncation handling.
Large Language Model
Transformers English